Consistent Estimation of Functions of Data Missing Non-Monotonically and Not at Random

نویسنده

  • Ilya Shpitser
چکیده

Missing records are a perennial problem in analysis of complex data of all types, when the target of inference is some function of the full data law. In simple cases, where data is missing at random or completely at random [15], well-known adjustments exist that result in consistent estimators of target quantities. Assumptions underlying these estimators are generally not realistic in practical missing data problems. Unfortunately, consistent estimators in more complex cases where data is missing not at random, and where no ordering on variables induces monotonicity of missingness status are not known in general, with some notable exceptions [13, 18, 16]. In this paper, we propose a general class of consistent estimators for cases where data is missing not at random, and missingness status is non-monotonic. Our estimators, which are generalized inverse probability weighting estimators, make no assumptions on the underlying full data law, but instead place independence restrictions, and certain other fairly mild assumptions, on the distribution of missingness status conditional on the data. The assumptions we place on the distribution of missingness status conditional on the data can be viewed as a version of a conditional Markov random field (MRF) corresponding to a chain graph. Assumptions embedded in our model permit identification from the observed data law, and admit a natural fitting procedure based on the pseudo likelihood approach of [2]. We illustrate our approach with a simple simulation study, and an analysis of risk of premature birth in women in Botswana exposed to highly active anti-retroviral therapy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Approach to Estimate Parameters of a Random Coefficient Transition Binary Logistic Model with Non-monotone Missing Pattern and some Sensitivity Analyses

‎A transition binary logistic model with random coefficients is‎ ‎proposed to model the unemployment statues of household members in‎ ‎two seasons of spring and summer‎. ‎Data correspond to the labor‎ ‎force survey performed by Statistical Center of Iran in 2006.‎ ‎This model is introduced to take into account two kinds of‎ ‎correlation in the data one due to the longitudinal nature o...

متن کامل

Marginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data

A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...

متن کامل

Random regression models for estimation of covariance functions of growth in Iranian Kurdi sheep

Body weight (BW) records (n=11,659) of 4961 Kurdi sheep from 215 sires and 2085 dams were used to estimate the additive genetic, direct and maternal permanent environmental effects on growth from 1 to 300 days of age. The data were collected from 1993 to 2015 at a breeding station in North Khorasan province; Iran. Genetic parameters for growth traits were estimated using random regression test-...

متن کامل

Transport Property Estimation of Non-Uniform Porous Media

In this work a glass micromodel which its grains and pores are non-uniform in size, shape and distribution is considered as porous medium. A two-dimensional random network model of micromodel with non-uniform pores has been constructed. The non-uniformity of porous model is achieved by assigning parametric distribution functions to pores throat and pores length, which was measured using ima...

متن کامل

Estimation of Variance Components for Body Weight of Moghani Sheep Using B-Spline Random Regression Models

The aim of the present study was the estimation of (co) variance components and genetic parameters for body weight of Moghani sheep, using random regression models based on B-Splines functions. The data set included 9165 body weight records from 60 to 360 days of age from 2811 Moghani sheep, collected between 1994 to 2013 from Jafar-Abad Animal Research and Breeding Institute, Ardabil province,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016